Dataset statistics
| Number of variables | 9 |
|---|---|
| Number of observations | 392 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 27.7 KiB |
| Average record size in memory | 72.3 B |
Variable types
| NUM | 8 |
|---|---|
| CAT | 1 |
Reproduction
| Analysis started | 2020-08-25 01:12:55.812310 |
|---|---|
| Analysis finished | 2020-08-25 01:13:06.382340 |
| Duration | 10.57 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
cubicInches is highly correlated with cylinders and 1 other fields | High correlation |
cylinders is highly correlated with cubicInches | High correlation |
weightLbs is highly correlated with cubicInches | High correlation |
brand has 27 (6.9%) zeros | Zeros |
MPG
Real number (ℝ≥0)
| Distinct count | 127 |
|---|---|
| Unique (%) | 32.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.445918367346938 |
|---|---|
| Minimum | 9.0 |
| Maximum | 46.6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 13 |
| Q1 | 17 |
| median | 22.75 |
| Q3 | 29 |
| 95-th percentile | 37 |
| Maximum | 46.6 |
| Range | 37.6 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 7.805007487 |
|---|---|
| Coefficient of variation (CV) | 0.3328940826 |
| Kurtosis | -0.5159934946 |
| Mean | 23.44591837 |
| Median Absolute Deviation (MAD) | 5.8 |
| Skewness | 0.4570923231 |
| Sum | 9190.8 |
| Variance | 60.91814187 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 13 | 20 | 5.1% | |
| 14 | 19 | 4.8% | |
| 18 | 17 | 4.3% | |
| 15 | 16 | 4.1% | |
| 26 | 14 | 3.6% | |
| 16 | 13 | 3.3% | |
| 19 | 12 | 3.1% | |
| 24 | 11 | 2.8% | |
| 28 | 10 | 2.6% | |
| 25 | 10 | 2.6% | |
| 22 | 10 | 2.6% | |
| 27 | 9 | 2.3% | |
| 23 | 9 | 2.3% | |
| 20 | 9 | 2.3% | |
| 29 | 8 | 2.0% | |
| 31 | 7 | 1.8% | |
| 17 | 7 | 1.8% | |
| 21 | 7 | 1.8% | |
| 30 | 7 | 1.8% | |
| 36 | 6 | 1.5% | |
| 12 | 6 | 1.5% | |
| 32 | 6 | 1.5% | |
| 17.5 | 5 | 1.3% | |
| 15.5 | 5 | 1.3% | |
| 20.2 | 4 | 1.0% | |
| Other values (102) | 145 | 37.0% |
| Value | Count | Frequency (%) | |
| 9 | 1 | 0.3% | |
| 10 | 2 | 0.5% | |
| 11 | 4 | 1.0% | |
| 12 | 6 | 1.5% | |
| 13 | 20 | 5.1% | |
| 14 | 19 | 4.8% | |
| 14.5 | 1 | 0.3% | |
| 15 | 16 | 4.1% | |
| 15.5 | 5 | 1.3% | |
| 16 | 13 | 3.3% |
| Value | Count | Frequency (%) | |
| 46.6 | 1 | 0.3% | |
| 44.6 | 1 | 0.3% | |
| 44.3 | 1 | 0.3% | |
| 44 | 1 | 0.3% | |
| 43.4 | 1 | 0.3% | |
| 43.1 | 1 | 0.3% | |
| 41.5 | 1 | 0.3% | |
| 40.8 | 1 | 0.3% | |
| 39.4 | 1 | 0.3% | |
| 39.1 | 1 | 0.3% |
| Distinct count | 5 |
|---|---|
| Unique (%) | 1.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.471938775510204 |
|---|---|
| Minimum | 3.0 |
| Maximum | 8.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 4 |
| median | 4 |
| Q3 | 8 |
| 95-th percentile | 8 |
| Maximum | 8 |
| Range | 5 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 1.705783247 |
|---|---|
| Coefficient of variation (CV) | 0.3117328825 |
| Kurtosis | -1.398198638 |
| Mean | 5.471938776 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.5081092403 |
| Sum | 2145 |
| Variance | 2.909696487 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 4 | 199 | 50.8% | |
| 8 | 103 | 26.3% | |
| 6 | 83 | 21.2% | |
| 3 | 4 | 1.0% | |
| 5 | 3 | 0.8% |
| Value | Count | Frequency (%) | |
| 3 | 4 | 1.0% | |
| 4 | 199 | 50.8% | |
| 5 | 3 | 0.8% | |
| 6 | 83 | 21.2% | |
| 8 | 103 | 26.3% |
| Value | Count | Frequency (%) | |
| 8 | 103 | 26.3% | |
| 6 | 83 | 21.2% | |
| 5 | 3 | 0.8% | |
| 4 | 199 | 50.8% | |
| 3 | 4 | 1.0% |
| Distinct count | 80 |
|---|---|
| Unique (%) | 20.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 194.41326530612244 |
|---|---|
| Minimum | 68.0 |
| Maximum | 455.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 68 |
|---|---|
| 5-th percentile | 85 |
| Q1 | 105 |
| median | 151 |
| Q3 | 275.75 |
| 95-th percentile | 400 |
| Maximum | 455 |
| Range | 387 |
| Interquartile range (IQR) | 170.75 |
Descriptive statistics
| Standard deviation | 104.6428227 |
|---|---|
| Coefficient of variation (CV) | 0.5382493962 |
| Kurtosis | -0.7782892557 |
| Mean | 194.4132653 |
| Median Absolute Deviation (MAD) | 61 |
| Skewness | 0.7016875496 |
| Sum | 76210 |
| Variance | 10950.12034 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 97 | 21 | 5.4% | |
| 350 | 18 | 4.6% | |
| 98 | 18 | 4.6% | |
| 250 | 17 | 4.3% | |
| 318 | 17 | 4.3% | |
| 140 | 15 | 3.8% | |
| 400 | 13 | 3.3% | |
| 225 | 13 | 3.3% | |
| 91 | 12 | 3.1% | |
| 232 | 11 | 2.8% | |
| 302 | 11 | 2.8% | |
| 121 | 11 | 2.8% | |
| 151 | 9 | 2.3% | |
| 120 | 9 | 2.3% | |
| 351 | 8 | 2.0% | |
| 231 | 8 | 2.0% | |
| 90 | 8 | 2.0% | |
| 200 | 7 | 1.8% | |
| 105 | 7 | 1.8% | |
| 85 | 7 | 1.8% | |
| 304 | 7 | 1.8% | |
| 122 | 7 | 1.8% | |
| 79 | 6 | 1.5% | |
| 119 | 6 | 1.5% | |
| 156 | 6 | 1.5% | |
| Other values (55) | 120 | 30.6% |
| Value | Count | Frequency (%) | |
| 68 | 1 | 0.3% | |
| 70 | 3 | 0.8% | |
| 71 | 2 | 0.5% | |
| 72 | 1 | 0.3% | |
| 76 | 1 | 0.3% | |
| 78 | 1 | 0.3% | |
| 79 | 6 | 1.5% | |
| 80 | 1 | 0.3% | |
| 81 | 1 | 0.3% | |
| 83 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 455 | 3 | 0.8% | |
| 454 | 1 | 0.3% | |
| 440 | 2 | 0.5% | |
| 429 | 3 | 0.8% | |
| 400 | 13 | 3.3% | |
| 390 | 1 | 0.3% | |
| 383 | 2 | 0.5% | |
| 360 | 4 | 1.0% | |
| 351 | 8 | 2.0% | |
| 350 | 18 | 4.6% |
horsepower
Real number (ℝ≥0)
| Distinct count | 93 |
|---|---|
| Unique (%) | 23.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 104.46938775510205 |
|---|---|
| Minimum | 46.0 |
| Maximum | 230.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 46 |
|---|---|
| 5-th percentile | 60.55 |
| Q1 | 75 |
| median | 93.5 |
| Q3 | 126 |
| 95-th percentile | 180 |
| Maximum | 230 |
| Range | 184 |
| Interquartile range (IQR) | 51 |
Descriptive statistics
| Standard deviation | 38.49115993 |
|---|---|
| Coefficient of variation (CV) | 0.3684443908 |
| Kurtosis | 0.6969469997 |
| Mean | 104.4693878 |
| Median Absolute Deviation (MAD) | 19.5 |
| Skewness | 1.087326282 |
| Sum | 40952 |
| Variance | 1481.569393 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 150 | 22 | 5.6% | |
| 90 | 20 | 5.1% | |
| 88 | 19 | 4.8% | |
| 110 | 18 | 4.6% | |
| 100 | 17 | 4.3% | |
| 75 | 14 | 3.6% | |
| 95 | 14 | 3.6% | |
| 105 | 12 | 3.1% | |
| 67 | 12 | 3.1% | |
| 70 | 12 | 3.1% | |
| 65 | 10 | 2.6% | |
| 97 | 9 | 2.3% | |
| 85 | 9 | 2.3% | |
| 80 | 7 | 1.8% | |
| 145 | 7 | 1.8% | |
| 140 | 7 | 1.8% | |
| 72 | 6 | 1.5% | |
| 92 | 6 | 1.5% | |
| 78 | 6 | 1.5% | |
| 68 | 6 | 1.5% | |
| 84 | 6 | 1.5% | |
| 180 | 5 | 1.3% | |
| 115 | 5 | 1.3% | |
| 60 | 5 | 1.3% | |
| 71 | 5 | 1.3% | |
| Other values (68) | 133 | 33.9% |
| Value | Count | Frequency (%) | |
| 46 | 2 | 0.5% | |
| 48 | 3 | 0.8% | |
| 49 | 1 | 0.3% | |
| 52 | 4 | 1.0% | |
| 53 | 2 | 0.5% | |
| 54 | 1 | 0.3% | |
| 58 | 2 | 0.5% | |
| 60 | 5 | 1.3% | |
| 61 | 1 | 0.3% | |
| 62 | 2 | 0.5% |
| Value | Count | Frequency (%) | |
| 230 | 1 | 0.3% | |
| 225 | 3 | 0.8% | |
| 220 | 1 | 0.3% | |
| 215 | 3 | 0.8% | |
| 210 | 1 | 0.3% | |
| 208 | 1 | 0.3% | |
| 200 | 1 | 0.3% | |
| 198 | 2 | 0.5% | |
| 193 | 1 | 0.3% | |
| 190 | 3 | 0.8% |
| Distinct count | 346 |
|---|---|
| Unique (%) | 88.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2977.5841836734694 |
|---|---|
| Minimum | 1613.0 |
| Maximum | 5140.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 1613 |
|---|---|
| 5-th percentile | 1931.6 |
| Q1 | 2225.25 |
| median | 2803.5 |
| Q3 | 3614.75 |
| 95-th percentile | 4464 |
| Maximum | 5140 |
| Range | 3527 |
| Interquartile range (IQR) | 1389.5 |
Descriptive statistics
| Standard deviation | 849.40256 |
|---|---|
| Coefficient of variation (CV) | 0.2852656743 |
| Kurtosis | -0.8092593883 |
| Mean | 2977.584184 |
| Median Absolute Deviation (MAD) | 639.5 |
| Skewness | 0.5195856741 |
| Sum | 1167213 |
| Variance | 721484.709 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2130 | 4 | 1.0% | |
| 1985 | 4 | 1.0% | |
| 2125 | 3 | 0.8% | |
| 2945 | 3 | 0.8% | |
| 2265 | 3 | 0.8% | |
| 2720 | 3 | 0.8% | |
| 2155 | 3 | 0.8% | |
| 2300 | 3 | 0.8% | |
| 1950 | 2 | 0.5% | |
| 3940 | 2 | 0.5% | |
| 2930 | 2 | 0.5% | |
| 1937 | 2 | 0.5% | |
| 3425 | 2 | 0.5% | |
| 2110 | 2 | 0.5% | |
| 2065 | 2 | 0.5% | |
| 2408 | 2 | 0.5% | |
| 3672 | 2 | 0.5% | |
| 3725 | 2 | 0.5% | |
| 1825 | 2 | 0.5% | |
| 1990 | 2 | 0.5% | |
| 2670 | 2 | 0.5% | |
| 1965 | 2 | 0.5% | |
| 2045 | 2 | 0.5% | |
| 2164 | 2 | 0.5% | |
| 3410 | 2 | 0.5% | |
| Other values (321) | 332 | 84.7% |
| Value | Count | Frequency (%) | |
| 1613 | 1 | 0.3% | |
| 1649 | 1 | 0.3% | |
| 1755 | 1 | 0.3% | |
| 1760 | 1 | 0.3% | |
| 1773 | 1 | 0.3% | |
| 1795 | 2 | 0.5% | |
| 1800 | 2 | 0.5% | |
| 1825 | 2 | 0.5% | |
| 1834 | 1 | 0.3% | |
| 1835 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 5140 | 1 | 0.3% | |
| 4997 | 1 | 0.3% | |
| 4955 | 1 | 0.3% | |
| 4952 | 1 | 0.3% | |
| 4951 | 1 | 0.3% | |
| 4906 | 1 | 0.3% | |
| 4746 | 1 | 0.3% | |
| 4735 | 1 | 0.3% | |
| 4732 | 1 | 0.3% | |
| 4699 | 1 | 0.3% |
time-to-sixty
Real number (ℝ≥0)
| Distinct count | 17 |
|---|---|
| Unique (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.681122448979592 |
|---|---|
| Minimum | 8.0 |
| Maximum | 25.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 14 |
| median | 16 |
| Q3 | 17 |
| 95-th percentile | 20 |
| Maximum | 25 |
| Range | 17 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.761231566 |
|---|---|
| Coefficient of variation (CV) | 0.1760863468 |
| Kurtosis | 0.5026648441 |
| Mean | 15.68112245 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.3030136101 |
| Sum | 6147 |
| Variance | 7.62439976 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 16 | 64 | 16.3% | |
| 15 | 64 | 16.3% | |
| 14 | 49 | 12.5% | |
| 17 | 47 | 12.0% | |
| 13 | 35 | 8.9% | |
| 19 | 29 | 7.4% | |
| 18 | 29 | 7.4% | |
| 12 | 21 | 5.4% | |
| 11 | 13 | 3.3% | |
| 20 | 12 | 3.1% | |
| 21 | 8 | 2.0% | |
| 22 | 7 | 1.8% | |
| 10 | 6 | 1.5% | |
| 9 | 3 | 0.8% | |
| 24 | 2 | 0.5% | |
| 25 | 2 | 0.5% | |
| 8 | 1 | 0.3% |
| Value | Count | Frequency (%) | |
| 8 | 1 | 0.3% | |
| 9 | 3 | 0.8% | |
| 10 | 6 | 1.5% | |
| 11 | 13 | 3.3% | |
| 12 | 21 | 5.4% | |
| 13 | 35 | 8.9% | |
| 14 | 49 | 12.5% | |
| 15 | 64 | 16.3% | |
| 16 | 64 | 16.3% | |
| 17 | 47 | 12.0% |
| Value | Count | Frequency (%) | |
| 25 | 2 | 0.5% | |
| 24 | 2 | 0.5% | |
| 22 | 7 | 1.8% | |
| 21 | 8 | 2.0% | |
| 20 | 12 | 3.1% | |
| 19 | 29 | 7.4% | |
| 18 | 29 | 7.4% | |
| 17 | 47 | 12.0% | |
| 16 | 64 | 16.3% | |
| 15 | 64 | 16.3% |
year
Real number (ℝ≥0)
| Distinct count | 13 |
|---|---|
| Unique (%) | 3.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1976.9795918367347 |
|---|---|
| Minimum | 1971.0 |
| Maximum | 1983.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 1971 |
|---|---|
| 5-th percentile | 1971 |
| Q1 | 1974 |
| median | 1977 |
| Q3 | 1980 |
| 95-th percentile | 1983 |
| Maximum | 1983 |
| Range | 12 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.683736544 |
|---|---|
| Coefficient of variation (CV) | 0.001863315412 |
| Kurtosis | -1.16744622 |
| Mean | 1976.979592 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.01968829963 |
| Sum | 774976 |
| Variance | 13.56991492 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1974 | 40 | 10.2% | |
| 1979 | 36 | 9.2% | |
| 1977 | 34 | 8.7% | |
| 1983 | 30 | 7.7% | |
| 1976 | 30 | 7.7% | |
| 1980 | 29 | 7.4% | |
| 1971 | 29 | 7.4% | |
| 1982 | 28 | 7.1% | |
| 1973 | 28 | 7.1% | |
| 1978 | 28 | 7.1% | |
| 1981 | 27 | 6.9% | |
| 1972 | 27 | 6.9% | |
| 1975 | 26 | 6.6% |
| Value | Count | Frequency (%) | |
| 1971 | 29 | 7.4% | |
| 1972 | 27 | 6.9% | |
| 1973 | 28 | 7.1% | |
| 1974 | 40 | 10.2% | |
| 1975 | 26 | 6.6% | |
| 1976 | 30 | 7.7% | |
| 1977 | 34 | 8.7% | |
| 1978 | 28 | 7.1% | |
| 1979 | 36 | 9.2% | |
| 1980 | 29 | 7.4% |
| Value | Count | Frequency (%) | |
| 1983 | 30 | 7.7% | |
| 1982 | 28 | 7.1% | |
| 1981 | 27 | 6.9% | |
| 1980 | 29 | 7.4% | |
| 1979 | 36 | 9.2% | |
| 1978 | 28 | 7.1% | |
| 1977 | 34 | 8.7% | |
| 1976 | 30 | 7.7% | |
| 1975 | 26 | 6.6% | |
| 1974 | 40 | 10.2% |
| Distinct count | 30 |
|---|---|
| Unique (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.323979591836734 |
|---|---|
| Minimum | 0 |
| Maximum | 29 |
| Zeros | 27 |
| Zeros (%) | 6.9% |
| Memory size | 3.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 6 |
| median | 11 |
| Q3 | 21 |
| 95-th percentile | 29 |
| Maximum | 29 |
| Range | 29 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.558786423 |
|---|---|
| Coefficient of variation (CV) | 0.6423596167 |
| Kurtosis | -1.027448284 |
| Mean | 13.32397959 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.3061704322 |
| Sum | 5223 |
| Variance | 73.25282504 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 11 | 48 | 12.2% | |
| 6 | 47 | 12.0% | |
| 21 | 31 | 7.9% | |
| 9 | 28 | 7.1% | |
| 0 | 27 | 6.9% | |
| 26 | 26 | 6.6% | |
| 8 | 23 | 5.9% | |
| 29 | 22 | 5.6% | |
| 3 | 17 | 4.3% | |
| 22 | 16 | 4.1% | |
| 13 | 13 | 3.3% | |
| 14 | 12 | 3.1% | |
| 16 | 11 | 2.8% | |
| 18 | 10 | 2.6% | |
| 10 | 8 | 2.0% | |
| 20 | 8 | 2.0% | |
| 1 | 7 | 1.8% | |
| 28 | 6 | 1.5% | |
| 7 | 6 | 1.5% | |
| 19 | 4 | 1.0% | |
| 25 | 4 | 1.0% | |
| 24 | 4 | 1.0% | |
| 23 | 3 | 0.8% | |
| 15 | 3 | 0.8% | |
| 4 | 2 | 0.5% | |
| Other values (5) | 6 | 1.5% |
| Value | Count | Frequency (%) | |
| 0 | 27 | 6.9% | |
| 1 | 7 | 1.8% | |
| 2 | 2 | 0.5% | |
| 3 | 17 | 4.3% | |
| 4 | 2 | 0.5% | |
| 5 | 1 | 0.3% | |
| 6 | 47 | 12.0% | |
| 7 | 6 | 1.5% | |
| 8 | 23 | 5.9% | |
| 9 | 28 | 7.1% |
| Value | Count | Frequency (%) | |
| 29 | 22 | 5.6% | |
| 28 | 6 | 1.5% | |
| 27 | 1 | 0.3% | |
| 26 | 26 | 6.6% | |
| 25 | 4 | 1.0% | |
| 24 | 4 | 1.0% | |
| 23 | 3 | 0.8% | |
| 22 | 16 | 4.1% | |
| 21 | 31 | 7.9% | |
| 20 | 8 | 2.0% |
target
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.2 KiB |
| 2 | |
|---|---|
| 1 | |
| 0 |
| Value | Count | Frequency (%) | |
| 2 | 245 | 62.5% | |
| 1 | 79 | 20.2% | |
| 0 | 68 | 17.3% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Most occurring characters
| Value | Count | Frequency (%) | |
| 2 | 245 | 62.5% | |
| 1 | 79 | 20.2% | |
| 0 | 68 | 17.3% |
Most occurring categories
| Value | Count | Frequency (%) | |
| Decimal Number | 392 | 100.0% |
Most frequent Decimal Number characters
| Value | Count | Frequency (%) | |
| 2 | 245 | 62.5% | |
| 1 | 79 | 20.2% | |
| 0 | 68 | 17.3% |
Most occurring scripts
| Value | Count | Frequency (%) | |
| Common | 392 | 100.0% |
Most frequent Common characters
| Value | Count | Frequency (%) | |
| 2 | 245 | 62.5% | |
| 1 | 79 | 20.2% | |
| 0 | 68 | 17.3% |
Most occurring blocks
| Value | Count | Frequency (%) | |
| ASCII | 392 | 100.0% |
Most frequent ASCII characters
| Value | Count | Frequency (%) | |
| 2 | 245 | 62.5% | |
| 1 | 79 | 20.2% | |
| 0 | 68 | 17.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| MPG | cylinders | cubicInches | horsepower | weightLbs | time-to-sixty | year | brand | target | |
|---|---|---|---|---|---|---|---|---|---|
| 0 | 14.0 | 8.0 | 350.0 | 165.0 | 4209.0 | 12.0 | 1972.0 | 6 | 2 |
| 1 | 31.9 | 4.0 | 89.0 | 71.0 | 1925.0 | 14.0 | 1980.0 | 29 | 0 |
| 2 | 17.0 | 8.0 | 302.0 | 140.0 | 3449.0 | 11.0 | 1971.0 | 11 | 2 |
| 3 | 15.0 | 8.0 | 400.0 | 150.0 | 3761.0 | 10.0 | 1971.0 | 6 | 2 |
| 4 | 30.5 | 4.0 | 98.0 | 63.0 | 2051.0 | 17.0 | 1978.0 | 6 | 2 |
| 5 | 23.0 | 8.0 | 350.0 | 125.0 | 3900.0 | 17.0 | 1980.0 | 4 | 2 |
| 6 | 13.0 | 8.0 | 351.0 | 158.0 | 4363.0 | 13.0 | 1974.0 | 11 | 2 |
| 7 | 14.0 | 8.0 | 440.0 | 215.0 | 4312.0 | 9.0 | 1971.0 | 21 | 2 |
| 8 | 25.4 | 5.0 | 183.0 | 77.0 | 3530.0 | 20.0 | 1980.0 | 15 | 0 |
| 9 | 37.7 | 4.0 | 89.0 | 62.0 | 2050.0 | 17.0 | 1982.0 | 26 | 1 |
Last rows
| MPG | cylinders | cubicInches | horsepower | weightLbs | time-to-sixty | year | brand | target | |
|---|---|---|---|---|---|---|---|---|---|
| 382 | 17.6 | 8.0 | 302.0 | 129.0 | 3725.0 | 13.0 | 1980.0 | 11 | 2 |
| 383 | 19.0 | 3.0 | 70.0 | 97.0 | 2330.0 | 14.0 | 1973.0 | 14 | 1 |
| 384 | 36.0 | 4.0 | 79.0 | 58.0 | 1825.0 | 19.0 | 1978.0 | 23 | 0 |
| 385 | 33.0 | 4.0 | 105.0 | 74.0 | 2190.0 | 14.0 | 1982.0 | 29 | 0 |
| 386 | 25.0 | 4.0 | 113.0 | 95.0 | 2228.0 | 14.0 | 1972.0 | 26 | 1 |
| 387 | 25.5 | 4.0 | 122.0 | 96.0 | 2300.0 | 16.0 | 1978.0 | 21 | 2 |
| 388 | 21.0 | 6.0 | 155.0 | 107.0 | 2472.0 | 14.0 | 1974.0 | 16 | 2 |
| 389 | 11.0 | 8.0 | 318.0 | 210.0 | 4382.0 | 14.0 | 1971.0 | 9 | 2 |
| 390 | 17.0 | 6.0 | 163.0 | 125.0 | 3140.0 | 14.0 | 1979.0 | 28 | 0 |
| 391 | 36.0 | 4.0 | 105.0 | 74.0 | 1980.0 | 15.0 | 1983.0 | 29 | 0 |